Add ignore_eos option to GenerationConfig #16912

navsud · 2026-01-27T19:38:17Z

Summary: Allow LLM generation to continue until max_new_tokens is reached, even after encountering EOS tokens. This is useful for benchmarking and testing scenarios where complete token generation is desired regardless of natural stopping points.

Reviewed By: kimishpatel

Differential Revision: D91183072

pytorch-bot · 2026-01-27T19:38:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16912

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 5 New Failures

As of commit 69e3d17 with merge base 3914a7e ():

NEW FAILURES - The following jobs have failed:

pull / android / run-emulator (gh)
The process '/usr/bin/sh' failed with exit code 255
pull / test-llama-runner-qnn-linux (fp32, qnn_16a16w, qnn) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/oss_scripts/llama/qnn_multimodal_runner.cpp:307:7: error: type 'float' cannot be narrowed to 'int32_t' (aka 'int') in initializer list [-Wc++11-narrowing]
pull / test-llama-runner-qnn-linux (fp32, qnn_8a8w, qnn) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/oss_scripts/llama/qnn_multimodal_runner.cpp:307:7: error: type 'float' cannot be narrowed to 'int32_t' (aka 'int') in initializer list [-Wc++11-narrowing]
pull / test-static-llama-qnn-linux (stories_110m) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/oss_scripts/llama/qnn_multimodal_runner.cpp:307:7: error: type 'float' cannot be narrowed to 'int32_t' (aka 'int') in initializer list [-Wc++11-narrowing]
pull / test-static-llama-qnn-linux (stories_260k_bc) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/oss_scripts/llama/qnn_multimodal_runner.cpp:307:7: error: type 'float' cannot be narrowed to 'int32_t' (aka 'int') in initializer list [-Wc++11-narrowing]

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2026-01-27T19:38:24Z

@navsud has exported this pull request. If you are a Meta employee, you can view the originating Diff in D91183072.

kimishpatel

Review automatically exported from Phabricator review in Meta.

github-actions · 2026-01-27T19:39:02Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Summary: Allow LLM generation to continue until max_new_tokens is reached, even after encountering EOS tokens. This is useful for benchmarking and testing scenarios where complete token generation is desired regardless of natural stopping points. Reviewed By: kimishpatel Differential Revision: D91183072

navsud requested review from larryliu0820 and mergennachin as code owners January 27, 2026 19:38

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 27, 2026

meta-codesync bot added fb-exported meta-exported labels Jan 27, 2026

kimishpatel approved these changes Jan 27, 2026

View reviewed changes

navsud force-pushed the export-D91183072 branch from 01fbe42 to 69e3d17 Compare January 28, 2026 17:24

navsud requested a review from lucylq as a code owner January 28, 2026 17:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ignore_eos option to GenerationConfig #16912

Add ignore_eos option to GenerationConfig #16912

navsud commented Jan 27, 2026

Uh oh!

pytorch-bot bot commented Jan 27, 2026 •

edited

Loading

Uh oh!

meta-codesync bot commented Jan 27, 2026

Uh oh!

kimishpatel left a comment

Uh oh!

github-actions bot commented Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add ignore_eos option to GenerationConfig #16912

Are you sure you want to change the base?

Add ignore_eos option to GenerationConfig #16912

Conversation

navsud commented Jan 27, 2026

Uh oh!

pytorch-bot bot commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16912

❌ 5 New Failures

Uh oh!

meta-codesync bot commented Jan 27, 2026

Uh oh!

kimishpatel left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jan 27, 2026

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot bot commented Jan 27, 2026 •

edited

Loading

This PR needs a `release notes:` label